Tsinghua University at the Summarization Track of TAC 2008
نویسندگان
چکیده
This paper presents our extractive summarization systems at the update summarization track of TAC 2008. We proposed two novel methods, one is based on the information distance theory, and the other is based on the sentence centrality which derives from the centrality concept in the graph theory. The evaluation results show that the two submitted runs are very competitive to generate extractive summaries.
منابع مشابه
Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
This paper presents our extractive summarization systems at the update summarization track of TAC 2009. This system is based on our newly developed document summarization framework under the theory of conditional information distance among many objects. The best summary is defined in this paper to be the one which has the minimum information distance to the entire document set. The best update ...
متن کاملOverview of the TAC 2008 Update Summarization Task
The summarization track at the Text Analysis Conference (TAC) is a direct continuation of the Document Understanding Conference (DUC) series of workshops, focused on providing common data and evaluation framework for research in automatic summarization. In the TAC 2008 summarization track, the main task was to produce two 100-word summaries from two related sets of 10 documents, where the secon...
متن کاملIIIT Hyderabad at TAC 2008
This paper describes our participation at TAC 2008 in all the three tracks. For the Summarization Track we introduced two major features. First, a feature based on Information Loss if we don’t pick a particular sentence. Second, a language modeling extension that boosts novel terms and penalizes stale terms. During our post-TAC analysis we observed that a simple sentence position based summariz...
متن کاملExperimenting with Clause Segmentation for Text Summarization
In this paper, we describe our experiments with clause segmentation in producing summaries for the TAC 2008 Update Summarization Track. The submitted runs were designed to determine if a heuristic clause segmentation applied before sentence selection would improve summarization results by reducing the need for sentence compression approaches. A baseline summariser was used to test this hypothes...
متن کاملUofL at TAC 2008 Update Summarization and Question Answering
In this paper, we describe our update summarization and question answering (QA) systems participated in the TAC 2008 competition. We submitted three runs for the update summarization task using unsupervised and supervised techniques. On the other hand, the question answering system is built on our previous system participated in TREC 2007 QA track with different approach followed for the squish...
متن کامل